Serveur d'exploration MERS

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

kClust: fast and sensitive clustering of large protein sequence databases

Identifieur interne : 001E65 ( Main/Exploration ); précédent : 001E64; suivant : 001E66

kClust: fast and sensitive clustering of large protein sequence databases

Auteurs : Maria Hauser [Allemagne] ; Christian E. Mayer [Allemagne, Suisse] ; Johannes Söding [Allemagne]

Source :

RBID : PMC:3843501

Descripteurs français

English descriptors

Abstract

Background

Fueled by rapid progress in high-throughput sequencing, the size of public sequence databases doubles every two years. Searching the ever larger and more redundant databases is getting increasingly inefficient. Clustering can help to organize sequences into homologous and functionally similar groups and can improve the speed, sensitivity, and readability of homology searches. However, because the clustering time is quadratic in the number of sequences, standard sequence search methods are becoming impracticable.

Results

Here we present a method to cluster large protein sequence databases such as UniProt within days down to 20%–30% maximum pairwise sequence identity. kClust owes its speed and sensitivity to an alignment-free prefilter that calculates the cumulative score of all similar 6-mers between pairs of sequences, and to a dynamic programming algorithm that operates on pairs of similar 4-mers. To increase sensitivity further, kClust can run in profile-sequence comparison mode, with profiles computed from the clusters of a previous kClust iteration. kClust is two to three orders of magnitude faster than clustering based on NCBI BLAST, and on multidomain sequences of 20%–30% maximum pairwise sequence identity it achieves comparable sensitivity and a lower false discovery rate. It also compares favorably to CD-HIT and UCLUST in terms of false discovery rate, sensitivity, and speed.

Conclusions

kClust fills the need for a fast, sensitive, and accurate tool to cluster large protein sequence databases to below 30% sequence identity. kClust is freely available under GPL at http://toolkit.lmb.uni-muenchen.de/pub/kClust/.


Url:
DOI: 10.1186/1471-2105-14-248
PubMed: 23945046
PubMed Central: 3843501


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en">kClust: fast and sensitive clustering of large protein sequence databases</title>
<author>
<name sortKey="Hauser, Maria" sort="Hauser, Maria" uniqKey="Hauser M" first="Maria" last="Hauser">Maria Hauser</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377, Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bavière</region>
<region type="district" nuts="2">District de Haute-Bavière</region>
<settlement type="city">Munich</settlement>
</placeName>
<orgName type="university">Université Louis-et-Maximilien de Munich</orgName>
</affiliation>
</author>
<author>
<name sortKey="Mayer, Christian E" sort="Mayer, Christian E" uniqKey="Mayer C" first="Christian E" last="Mayer">Christian E. Mayer</name>
<affiliation wicri:level="3">
<nlm:aff id="I2">Department for Protein Evolution, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen 72076, Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Department for Protein Evolution, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen 72076</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bade-Wurtemberg</region>
<region type="district" nuts="2">District de Tübingen</region>
<settlement type="city">Tübingen</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Present address: D-BSSE, ETH Zuerich, Mattenstr. 26, Basel 4058, Switzerland</nlm:aff>
<country xml:lang="fr">Suisse</country>
<wicri:regionArea>Present address: D-BSSE, ETH Zuerich, Mattenstr. 26, Basel 4058</wicri:regionArea>
<wicri:noRegion>Basel 4058</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Soding, Johannes" sort="Soding, Johannes" uniqKey="Soding J" first="Johannes" last="Söding">Johannes Söding</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377, Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bavière</region>
<region type="district" nuts="2">District de Haute-Bavière</region>
<settlement type="city">Munich</settlement>
</placeName>
<orgName type="university">Université Louis-et-Maximilien de Munich</orgName>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">PMC</idno>
<idno type="pmid">23945046</idno>
<idno type="pmc">3843501</idno>
<idno type="url">http://www.ncbi.nlm.nih.gov/pmc/articles/PMC3843501</idno>
<idno type="RBID">PMC:3843501</idno>
<idno type="doi">10.1186/1471-2105-14-248</idno>
<date when="2013">2013</date>
<idno type="wicri:Area/Pmc/Corpus">000934</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Corpus" wicri:corpus="PMC">000934</idno>
<idno type="wicri:Area/Pmc/Curation">000934</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Curation">000934</idno>
<idno type="wicri:Area/Pmc/Checkpoint">001154</idno>
<idno type="wicri:explorRef" wicri:stream="Pmc" wicri:step="Checkpoint">001154</idno>
<idno type="wicri:source">PubMed</idno>
<idno type="RBID">pubmed:23945046</idno>
<idno type="wicri:Area/PubMed/Corpus">001C19</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Corpus" wicri:corpus="PubMed">001C19</idno>
<idno type="wicri:Area/PubMed/Curation">001C19</idno>
<idno type="wicri:explorRef" wicri:stream="PubMed" wicri:step="Curation">001C19</idno>
<idno type="wicri:Area/PubMed/Checkpoint">001A23</idno>
<idno type="wicri:explorRef" wicri:stream="Checkpoint" wicri:step="PubMed">001A23</idno>
<idno type="wicri:Area/Ncbi/Merge">000B18</idno>
<idno type="wicri:Area/Ncbi/Curation">000B18</idno>
<idno type="wicri:Area/Ncbi/Checkpoint">000B18</idno>
<idno type="wicri:Area/Main/Merge">001E80</idno>
<idno type="wicri:Area/Main/Curation">001E65</idno>
<idno type="wicri:Area/Main/Exploration">001E65</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a" type="main">kClust: fast and sensitive clustering of large protein sequence databases</title>
<author>
<name sortKey="Hauser, Maria" sort="Hauser, Maria" uniqKey="Hauser M" first="Maria" last="Hauser">Maria Hauser</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377, Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bavière</region>
<region type="district" nuts="2">District de Haute-Bavière</region>
<settlement type="city">Munich</settlement>
</placeName>
<orgName type="university">Université Louis-et-Maximilien de Munich</orgName>
</affiliation>
</author>
<author>
<name sortKey="Mayer, Christian E" sort="Mayer, Christian E" uniqKey="Mayer C" first="Christian E" last="Mayer">Christian E. Mayer</name>
<affiliation wicri:level="3">
<nlm:aff id="I2">Department for Protein Evolution, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen 72076, Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Department for Protein Evolution, Max Planck Institute for Developmental Biology, Spemannstr. 35, Tübingen 72076</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bade-Wurtemberg</region>
<region type="district" nuts="2">District de Tübingen</region>
<settlement type="city">Tübingen</settlement>
</placeName>
</affiliation>
<affiliation wicri:level="1">
<nlm:aff id="I3">Present address: D-BSSE, ETH Zuerich, Mattenstr. 26, Basel 4058, Switzerland</nlm:aff>
<country xml:lang="fr">Suisse</country>
<wicri:regionArea>Present address: D-BSSE, ETH Zuerich, Mattenstr. 26, Basel 4058</wicri:regionArea>
<wicri:noRegion>Basel 4058</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Soding, Johannes" sort="Soding, Johannes" uniqKey="Soding J" first="Johannes" last="Söding">Johannes Söding</name>
<affiliation wicri:level="4">
<nlm:aff id="I1">Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377, Germany</nlm:aff>
<country xml:lang="fr">Allemagne</country>
<wicri:regionArea>Gene Center and Center for Integrated Protein Science (CIPSM), Ludwig-Maximilians-Universität München, Feodor-Lynen-Str. 25, Munich 81377</wicri:regionArea>
<placeName>
<region type="land" nuts="1">Bavière</region>
<region type="district" nuts="2">District de Haute-Bavière</region>
<settlement type="city">Munich</settlement>
</placeName>
<orgName type="university">Université Louis-et-Maximilien de Munich</orgName>
</affiliation>
</author>
</analytic>
<series>
<title level="j">BMC Bioinformatics</title>
<idno type="eISSN">1471-2105</idno>
<imprint>
<date when="2013">2013</date>
</imprint>
</series>
</biblStruct>
</sourceDesc>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Algorithms</term>
<term>Cluster Analysis</term>
<term>Databases, Factual</term>
<term>Databases, Protein</term>
<term>Sequence Analysis, Protein (methods)</term>
<term>Software</term>
</keywords>
<keywords scheme="KwdFr" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de regroupements</term>
<term>Analyse de séquence de protéine ()</term>
<term>Bases de données de protéines</term>
<term>Bases de données factuelles</term>
<term>Logiciel</term>
</keywords>
<keywords scheme="MESH" qualifier="methods" xml:lang="en">
<term>Sequence Analysis, Protein</term>
</keywords>
<keywords scheme="MESH" xml:lang="en">
<term>Algorithms</term>
<term>Cluster Analysis</term>
<term>Databases, Factual</term>
<term>Databases, Protein</term>
<term>Software</term>
</keywords>
<keywords scheme="MESH" xml:lang="fr">
<term>Algorithmes</term>
<term>Analyse de regroupements</term>
<term>Analyse de séquence de protéine</term>
<term>Bases de données de protéines</term>
<term>Bases de données factuelles</term>
<term>Logiciel</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">
<sec>
<title>Background</title>
<p>Fueled by rapid progress in high-throughput sequencing, the size of public sequence databases doubles every two years. Searching the ever larger and more redundant databases is getting increasingly inefficient. Clustering can help to organize sequences into homologous and functionally similar groups and can improve the speed, sensitivity, and readability of homology searches. However, because the clustering time is quadratic in the number of sequences, standard sequence search methods are becoming impracticable.</p>
</sec>
<sec>
<title>Results</title>
<p>Here we present a method to cluster large protein sequence databases such as UniProt within days down to 20%–30% maximum pairwise sequence identity. kClust owes its speed and sensitivity to an alignment-free prefilter that calculates the cumulative score of all similar 6-mers between pairs of sequences, and to a dynamic programming algorithm that operates on pairs of similar 4-mers. To increase sensitivity further, kClust can run in profile-sequence comparison mode, with profiles computed from the clusters of a previous kClust iteration. kClust is two to three orders of magnitude faster than clustering based on NCBI BLAST, and on multidomain sequences of 20%–30% maximum pairwise sequence identity it achieves comparable sensitivity and a lower false discovery rate. It also compares favorably to CD-HIT and UCLUST in terms of false discovery rate, sensitivity, and speed.</p>
</sec>
<sec>
<title>Conclusions</title>
<p>kClust fills the need for a fast, sensitive, and accurate tool to cluster large protein sequence databases to below 30% sequence identity. kClust is freely available under GPL at
<ext-link ext-link-type="uri" xlink:href="http://toolkit.lmb.uni-muenchen.de/pub/kClust/">http://toolkit.lmb.uni-muenchen.de/pub/kClust/</ext-link>
.</p>
</sec>
</div>
</front>
<back>
<div1 type="bibliography">
<listBibl>
<biblStruct>
<analytic>
<author>
<name sortKey="Chubb, D" uniqKey="Chubb D">D Chubb</name>
</author>
<author>
<name sortKey="Jefferys, Br" uniqKey="Jefferys B">BR Jefferys</name>
</author>
<author>
<name sortKey="Sternberg, Mje" uniqKey="Sternberg M">MJE Sternberg</name>
</author>
<author>
<name sortKey="Kelley, La" uniqKey="Kelley L">LA Kelley</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author>
<name sortKey="Jaroszewski, L" uniqKey="Jaroszewski L">L Jaroszewski</name>
</author>
<author>
<name sortKey="Godzik, A" uniqKey="Godzik A">A Godzik</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Park, J" uniqKey="Park J">J Park</name>
</author>
<author>
<name sortKey="Holm, L" uniqKey="Holm L">L Holm</name>
</author>
<author>
<name sortKey="Heger, A" uniqKey="Heger A">A Heger</name>
</author>
<author>
<name sortKey="Chothia, C" uniqKey="Chothia C">C Chothia</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Suzek, B" uniqKey="Suzek B">B Suzek</name>
</author>
<author>
<name sortKey="Huang, H" uniqKey="Huang H">H Huang</name>
</author>
<author>
<name sortKey="Mcgarvey, P" uniqKey="Mcgarvey P">P McGarvey</name>
</author>
<author>
<name sortKey="Mazumder, R" uniqKey="Mazumder R">R Mazumder</name>
</author>
<author>
<name sortKey="Wu, C" uniqKey="Wu C">C Wu</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rusch, Db" uniqKey="Rusch D">DB Rusch</name>
</author>
<author>
<name sortKey="Halpern, Al" uniqKey="Halpern A">AL Halpern</name>
</author>
<author>
<name sortKey="Sutton, G" uniqKey="Sutton G">G Sutton</name>
</author>
<author>
<name sortKey="Heidelberg, Kb" uniqKey="Heidelberg K">KB Heidelberg</name>
</author>
<author>
<name sortKey="Williamson, S" uniqKey="Williamson S">S Williamson</name>
</author>
<author>
<name sortKey="Yooseph, S" uniqKey="Yooseph S">S Yooseph</name>
</author>
<author>
<name sortKey="Wu, D" uniqKey="Wu D">D Wu</name>
</author>
<author>
<name sortKey="Eisen, Ja" uniqKey="Eisen J">JA Eisen</name>
</author>
<author>
<name sortKey="Hoffman, Jm" uniqKey="Hoffman J">JM Hoffman</name>
</author>
<author>
<name sortKey="Remington, K" uniqKey="Remington K">K Remington</name>
</author>
<author>
<name sortKey="Beeson, K" uniqKey="Beeson K">K Beeson</name>
</author>
<author>
<name sortKey="Tran, B" uniqKey="Tran B">B Tran</name>
</author>
<author>
<name sortKey="Baden Tillson, H" uniqKey="Baden Tillson H">H Baden-Tillson</name>
</author>
<author>
<name sortKey="Stewart, C" uniqKey="Stewart C">C Stewart</name>
</author>
<author>
<name sortKey="Thorpe, J" uniqKey="Thorpe J">J Thorpe</name>
</author>
<author>
<name sortKey="Freeman, J" uniqKey="Freeman J">J Freeman</name>
</author>
<author>
<name sortKey="Andrews Pfannkoch, C" uniqKey="Andrews Pfannkoch C">C Andrews-Pfannkoch</name>
</author>
<author>
<name sortKey="Venter, Je" uniqKey="Venter J">JE Venter</name>
</author>
<author>
<name sortKey="Li, K" uniqKey="Li K">K Li</name>
</author>
<author>
<name sortKey="Kravitz, S" uniqKey="Kravitz S">S Kravitz</name>
</author>
<author>
<name sortKey="Heidelberg, Jf" uniqKey="Heidelberg J">JF Heidelberg</name>
</author>
<author>
<name sortKey="Utterback, T" uniqKey="Utterback T">T Utterback</name>
</author>
<author>
<name sortKey="Rogers, Y" uniqKey="Rogers Y">Y Rogers</name>
</author>
<author>
<name sortKey="Falc N, Li" uniqKey="Falc N L">LI Falcón</name>
</author>
<author>
<name sortKey="Souza, V" uniqKey="Souza V">V Souza</name>
</author>
<author>
<name sortKey="Bonilla Rosso, G" uniqKey="Bonilla Rosso G">G Bonilla-Rosso</name>
</author>
<author>
<name sortKey="Eguiarte, Le" uniqKey="Eguiarte L">LE Eguiarte</name>
</author>
<author>
<name sortKey="Karl, Dm" uniqKey="Karl D">DM Karl</name>
</author>
<author>
<name sortKey="Sathyendranath, S" uniqKey="Sathyendranath S">S Sathyendranath</name>
</author>
</analytic>
</biblStruct>
<biblStruct></biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Qin, J" uniqKey="Qin J">J Qin</name>
</author>
<author>
<name sortKey="Li, R" uniqKey="Li R">R Li</name>
</author>
<author>
<name sortKey="Raes, J" uniqKey="Raes J">J Raes</name>
</author>
<author>
<name sortKey="Arumugam, M" uniqKey="Arumugam M">M Arumugam</name>
</author>
<author>
<name sortKey="Burgdorf, Ks" uniqKey="Burgdorf K">KS Burgdorf</name>
</author>
<author>
<name sortKey="Manichanh, C" uniqKey="Manichanh C">C Manichanh</name>
</author>
<author>
<name sortKey="Nielsen, T" uniqKey="Nielsen T">T Nielsen</name>
</author>
<author>
<name sortKey="Pons, N" uniqKey="Pons N">N Pons</name>
</author>
<author>
<name sortKey="Levenez, F" uniqKey="Levenez F">F Levenez</name>
</author>
<author>
<name sortKey="Yamada, T" uniqKey="Yamada T">T Yamada</name>
</author>
<author>
<name sortKey="Mende, Dr" uniqKey="Mende D">DR Mende</name>
</author>
<author>
<name sortKey="Li, J" uniqKey="Li J">J Li</name>
</author>
<author>
<name sortKey="Xu, J" uniqKey="Xu J">J Xu</name>
</author>
<author>
<name sortKey="Li, S" uniqKey="Li S">S Li</name>
</author>
<author>
<name sortKey="Li, D" uniqKey="Li D">D Li</name>
</author>
<author>
<name sortKey="Cao, J" uniqKey="Cao J">J Cao</name>
</author>
<author>
<name sortKey="Wang, B" uniqKey="Wang B">B Wang</name>
</author>
<author>
<name sortKey="Liang, H" uniqKey="Liang H">H Liang</name>
</author>
<author>
<name sortKey="Zheng, H" uniqKey="Zheng H">H Zheng</name>
</author>
<author>
<name sortKey="Xie, Y" uniqKey="Xie Y">Y Xie</name>
</author>
<author>
<name sortKey="Tap, J" uniqKey="Tap J">J Tap</name>
</author>
<author>
<name sortKey="Lepage, P" uniqKey="Lepage P">P Lepage</name>
</author>
<author>
<name sortKey="Bertalan, M" uniqKey="Bertalan M">M Bertalan</name>
</author>
<author>
<name sortKey="Batto, Jm" uniqKey="Batto J">JM Batto</name>
</author>
<author>
<name sortKey="Hansen, T" uniqKey="Hansen T">T Hansen</name>
</author>
<author>
<name sortKey="Le Paslier, D" uniqKey="Le Paslier D">D Le Paslier</name>
</author>
<author>
<name sortKey="Linneberg, A" uniqKey="Linneberg A">A Linneberg</name>
</author>
<author>
<name sortKey="Nielsen, Hb" uniqKey="Nielsen H">HB Nielsen</name>
</author>
<author>
<name sortKey="Pelletier, E" uniqKey="Pelletier E">E Pelletier</name>
</author>
<author>
<name sortKey="Renault, P" uniqKey="Renault P">P Renault</name>
</author>
<author>
<name sortKey="Et, Al" uniqKey="Et A">al et</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Remmert, M" uniqKey="Remmert M">M Remmert</name>
</author>
<author>
<name sortKey="Biegert, A" uniqKey="Biegert A">A Biegert</name>
</author>
<author>
<name sortKey="Hauser, A" uniqKey="Hauser A">A Hauser</name>
</author>
<author>
<name sortKey="Soding, J" uniqKey="Soding J">J Söding</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
<author>
<name sortKey="Madden, Tl" uniqKey="Madden T">TL Madden</name>
</author>
<author>
<name sortKey="Sch Ffer, Aa" uniqKey="Sch Ffer A">AA Schäffer</name>
</author>
<author>
<name sortKey="Zhang, J" uniqKey="Zhang J">J Zhang</name>
</author>
<author>
<name sortKey="Zhang, Z" uniqKey="Zhang Z">Z Zhang</name>
</author>
<author>
<name sortKey="Miller, W" uniqKey="Miller W">W Miller</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pearson, W" uniqKey="Pearson W">W Pearson</name>
</author>
<author>
<name sortKey="Lipman, D" uniqKey="Lipman D">D Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Altschul, Sf" uniqKey="Altschul S">SF Altschul</name>
</author>
<author>
<name sortKey="Gish, W" uniqKey="Gish W">W Gish</name>
</author>
<author>
<name sortKey="Miller, W" uniqKey="Miller W">W Miller</name>
</author>
<author>
<name sortKey="Myers, Ew" uniqKey="Myers E">EW Myers</name>
</author>
<author>
<name sortKey="Lipman, Dj" uniqKey="Lipman D">DJ Lipman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Yona, G" uniqKey="Yona G">G Yona</name>
</author>
<author>
<name sortKey="Linial, N" uniqKey="Linial N">N Linial</name>
</author>
<author>
<name sortKey="Linial, M" uniqKey="Linial M">M Linial</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Krause, A" uniqKey="Krause A">A Krause</name>
</author>
<author>
<name sortKey="Stoye, J" uniqKey="Stoye J">J Stoye</name>
</author>
<author>
<name sortKey="Vingron, M" uniqKey="Vingron M">M Vingron</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Miele, V" uniqKey="Miele V">V Miele</name>
</author>
<author>
<name sortKey="Penel, S" uniqKey="Penel S">S Penel</name>
</author>
<author>
<name sortKey="Duret, L" uniqKey="Duret L">L Duret</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rappoport, N" uniqKey="Rappoport N">N Rappoport</name>
</author>
<author>
<name sortKey="Karsenty, S" uniqKey="Karsenty S">S Karsenty</name>
</author>
<author>
<name sortKey="Stern, A" uniqKey="Stern A">A Stern</name>
</author>
<author>
<name sortKey="Linial, N" uniqKey="Linial N">N Linial</name>
</author>
<author>
<name sortKey="Linial, M" uniqKey="Linial M">M Linial</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Remm, M" uniqKey="Remm M">M Remm</name>
</author>
<author>
<name sortKey="Storm, Ce" uniqKey="Storm C">CE Storm</name>
</author>
<author>
<name sortKey="Sonnhammer, El" uniqKey="Sonnhammer E">EL Sonnhammer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Enright, Aj" uniqKey="Enright A">AJ Enright</name>
</author>
<author>
<name sortKey="Van Dongen, S" uniqKey="Van Dongen S">S Van Dongen</name>
</author>
<author>
<name sortKey="Ouzounis, Ca" uniqKey="Ouzounis C">CA Ouzounis</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Tatusov, Rl" uniqKey="Tatusov R">RL Tatusov</name>
</author>
<author>
<name sortKey="Fedorova, Nd" uniqKey="Fedorova N">ND Fedorova</name>
</author>
<author>
<name sortKey="Jackson, Jd" uniqKey="Jackson J">JD Jackson</name>
</author>
<author>
<name sortKey="Jacobs, Ar" uniqKey="Jacobs A">AR Jacobs</name>
</author>
<author>
<name sortKey="Kiryutin, B" uniqKey="Kiryutin B">B Kiryutin</name>
</author>
<author>
<name sortKey="Koonin, Ev" uniqKey="Koonin E">EV Koonin</name>
</author>
<author>
<name sortKey="Krylov, Dm" uniqKey="Krylov D">DM Krylov</name>
</author>
<author>
<name sortKey="Mazumder, R" uniqKey="Mazumder R">R Mazumder</name>
</author>
<author>
<name sortKey="Mekhedov, Sl" uniqKey="Mekhedov S">SL Mekhedov</name>
</author>
<author>
<name sortKey="Nikolskaya, An" uniqKey="Nikolskaya A">AN Nikolskaya</name>
</author>
<author>
<name sortKey="Rao, Bs" uniqKey="Rao B">BS Rao</name>
</author>
<author>
<name sortKey="Smirnov, S" uniqKey="Smirnov S">S Smirnov</name>
</author>
<author>
<name sortKey="Sverdlov, Av" uniqKey="Sverdlov A">AV Sverdlov</name>
</author>
<author>
<name sortKey="Vasudevan, S" uniqKey="Vasudevan S">S Vasudevan</name>
</author>
<author>
<name sortKey="Wolf, Yi" uniqKey="Wolf Y">YI Wolf</name>
</author>
<author>
<name sortKey="Yin, Jj" uniqKey="Yin J">JJ Yin</name>
</author>
<author>
<name sortKey="Natale, Da" uniqKey="Natale D">DA Natale</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, L" uniqKey="Li L">L Li</name>
</author>
<author>
<name sortKey="Stoeckert, Cj" uniqKey="Stoeckert C">CJ Stoeckert</name>
</author>
<author>
<name sortKey="Roos, Ds" uniqKey="Roos D">DS Roos</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Alexeyenko, A" uniqKey="Alexeyenko A">A Alexeyenko</name>
</author>
<author>
<name sortKey="Tamas, I" uniqKey="Tamas I">I Tamas</name>
</author>
<author>
<name sortKey="Liu, G" uniqKey="Liu G">G Liu</name>
</author>
<author>
<name sortKey="Sonnhammer, El" uniqKey="Sonnhammer E">EL Sonnhammer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Chen, Tw" uniqKey="Chen T">TW Chen</name>
</author>
<author>
<name sortKey="Wu, Th" uniqKey="Wu T">TH Wu</name>
</author>
<author>
<name sortKey="Ng, Wv" uniqKey="Ng W">WV Ng</name>
</author>
<author>
<name sortKey="Lin, Wc" uniqKey="Lin W">WC Lin</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Powell, S" uniqKey="Powell S">S Powell</name>
</author>
<author>
<name sortKey="Szklarczyk, D" uniqKey="Szklarczyk D">D Szklarczyk</name>
</author>
<author>
<name sortKey="Trachana, K" uniqKey="Trachana K">K Trachana</name>
</author>
<author>
<name sortKey="Roth, A" uniqKey="Roth A">A Roth</name>
</author>
<author>
<name sortKey="Kuhn, M" uniqKey="Kuhn M">M Kuhn</name>
</author>
<author>
<name sortKey="Muller, J" uniqKey="Muller J">J Muller</name>
</author>
<author>
<name sortKey="Arnold, R" uniqKey="Arnold R">R Arnold</name>
</author>
<author>
<name sortKey="Rattei, T" uniqKey="Rattei T">T Rattei</name>
</author>
<author>
<name sortKey="Letunic, I" uniqKey="Letunic I">I Letunic</name>
</author>
<author>
<name sortKey="Doerks, T" uniqKey="Doerks T">T Doerks</name>
</author>
<author>
<name sortKey="Jensen, Lj" uniqKey="Jensen L">LJ Jensen</name>
</author>
<author>
<name sortKey="Von Mering, C" uniqKey="Von Mering C">C von Mering</name>
</author>
<author>
<name sortKey="Bork, P" uniqKey="Bork P">P Bork</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Pearson, Wr" uniqKey="Pearson W">WR Pearson</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Rattei, T" uniqKey="Rattei T">T Rattei</name>
</author>
<author>
<name sortKey="Tischler, P" uniqKey="Tischler P">P Tischler</name>
</author>
<author>
<name sortKey="Gotz, S" uniqKey="Gotz S">S Götz</name>
</author>
<author>
<name sortKey="Jehl, Ma" uniqKey="Jehl M">MA Jehl</name>
</author>
<author>
<name sortKey="Hoser, J" uniqKey="Hoser J">J Hoser</name>
</author>
<author>
<name sortKey="Arnold, R" uniqKey="Arnold R">R Arnold</name>
</author>
<author>
<name sortKey="Conesa, A" uniqKey="Conesa A">A Conesa</name>
</author>
<author>
<name sortKey="Mewes, Hw" uniqKey="Mewes H">HW Mewes</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author>
<name sortKey="Jaroszewski, L" uniqKey="Jaroszewski L">L Jaroszewski</name>
</author>
<author>
<name sortKey="Godzik, A" uniqKey="Godzik A">A Godzik</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
<author>
<name sortKey="Godzik, A" uniqKey="Godzik A">A Godzik</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Fu, L" uniqKey="Fu L">L Fu</name>
</author>
<author>
<name sortKey="Niu, B" uniqKey="Niu B">B Niu</name>
</author>
<author>
<name sortKey="Zhu, Z" uniqKey="Zhu Z">Z Zhu</name>
</author>
<author>
<name sortKey="Wu, S" uniqKey="Wu S">S Wu</name>
</author>
<author>
<name sortKey="Li, W" uniqKey="Li W">W Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Edgar, Rc" uniqKey="Edgar R">RC Edgar</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hobohm, U" uniqKey="Hobohm U">U Hobohm</name>
</author>
<author>
<name sortKey="Scharf, M" uniqKey="Scharf M">M Scharf</name>
</author>
<author>
<name sortKey="Schneider, R" uniqKey="Schneider R">R Schneider</name>
</author>
<author>
<name sortKey="Sander, C" uniqKey="Sander C">C Sander</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Ma, B" uniqKey="Ma B">B Ma</name>
</author>
<author>
<name sortKey="Tromp, J" uniqKey="Tromp J">J Tromp</name>
</author>
<author>
<name sortKey="Li, M" uniqKey="Li M">M Li</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Mayer, Ce" uniqKey="Mayer C">CE Mayer</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Przybylski, D" uniqKey="Przybylski D">D Przybylski</name>
</author>
<author>
<name sortKey="Rost, B" uniqKey="Rost B">B Rost</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Finn, Rd" uniqKey="Finn R">RD Finn</name>
</author>
<author>
<name sortKey="Mistry, J" uniqKey="Mistry J">J Mistry</name>
</author>
<author>
<name sortKey="Tate, J" uniqKey="Tate J">J Tate</name>
</author>
<author>
<name sortKey="Coggill, P" uniqKey="Coggill P">P Coggill</name>
</author>
<author>
<name sortKey="Heger, A" uniqKey="Heger A">A Heger</name>
</author>
<author>
<name sortKey="Pollington, Ja" uniqKey="Pollington J">JA Pollington</name>
</author>
<author>
<name sortKey="Gavin, Ol" uniqKey="Gavin O">OL Gavin</name>
</author>
<author>
<name sortKey="Gunasekaran, P" uniqKey="Gunasekaran P">P Gunasekaran</name>
</author>
<author>
<name sortKey="Ceric, G" uniqKey="Ceric G">G Ceric</name>
</author>
<author>
<name sortKey="Forslund, K" uniqKey="Forslund K">K Forslund</name>
</author>
<author>
<name sortKey="Holm, L" uniqKey="Holm L">L Holm</name>
</author>
<author>
<name sortKey="Sonnhammer, El" uniqKey="Sonnhammer E">EL Sonnhammer</name>
</author>
<author>
<name sortKey="Eddy, Sr" uniqKey="Eddy S">SR Eddy</name>
</author>
<author>
<name sortKey="Bateman, A" uniqKey="Bateman A">A Bateman</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Lo Conte, L" uniqKey="Lo Conte L">L Lo Conte</name>
</author>
<author>
<name sortKey="Ailey, B" uniqKey="Ailey B">B Ailey</name>
</author>
<author>
<name sortKey="Hubbard, Tj" uniqKey="Hubbard T">TJ Hubbard</name>
</author>
<author>
<name sortKey="Brenner, Se" uniqKey="Brenner S">SE Brenner</name>
</author>
<author>
<name sortKey="Murzin, Ag" uniqKey="Murzin A">AG Murzin</name>
</author>
<author>
<name sortKey="Chothia, C" uniqKey="Chothia C">C Chothia</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Apweiler, R" uniqKey="Apweiler R">R Apweiler</name>
</author>
<author>
<name sortKey="Bairoch, A" uniqKey="Bairoch A">A Bairoch</name>
</author>
<author>
<name sortKey="Wu, Ch" uniqKey="Wu C">CH Wu</name>
</author>
<author>
<name sortKey="Barker, Wc" uniqKey="Barker W">WC Barker</name>
</author>
<author>
<name sortKey="Boeckmann, B" uniqKey="Boeckmann B">B Boeckmann</name>
</author>
<author>
<name sortKey="Ferro, S" uniqKey="Ferro S">S Ferro</name>
</author>
<author>
<name sortKey="Gasteiger, E" uniqKey="Gasteiger E">E Gasteiger</name>
</author>
<author>
<name sortKey="Huang, H" uniqKey="Huang H">H Huang</name>
</author>
<author>
<name sortKey="Lopez, R" uniqKey="Lopez R">R Lopez</name>
</author>
<author>
<name sortKey="Magrane, M" uniqKey="Magrane M">M Magrane</name>
</author>
<author>
<name sortKey="Martin, Mj" uniqKey="Martin M">MJ Martin</name>
</author>
<author>
<name sortKey="Natale, Da" uniqKey="Natale D">DA Natale</name>
</author>
<author>
<name sortKey="O Onovan, C" uniqKey="O Onovan C">C O’Donovan</name>
</author>
<author>
<name sortKey="Redaschi, N" uniqKey="Redaschi N">N Redaschi</name>
</author>
<author>
<name sortKey="Yeh, Ls" uniqKey="Yeh L">LS Yeh</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Hegyi, H" uniqKey="Hegyi H">H Hegyi</name>
</author>
<author>
<name sortKey="Gerstein, M" uniqKey="Gerstein M">M Gerstein</name>
</author>
</analytic>
</biblStruct>
<biblStruct>
<analytic>
<author>
<name sortKey="Bao, E" uniqKey="Bao E">E Bao</name>
</author>
<author>
<name sortKey="Jiang, T" uniqKey="Jiang T">T Jiang</name>
</author>
<author>
<name sortKey="Kaloshian, I" uniqKey="Kaloshian I">I Kaloshian</name>
</author>
<author>
<name sortKey="Girke, T" uniqKey="Girke T">T Girke</name>
</author>
</analytic>
</biblStruct>
</listBibl>
</div1>
</back>
</TEI>
<affiliations>
<list>
<country>
<li>Allemagne</li>
<li>Suisse</li>
</country>
<region>
<li>Bade-Wurtemberg</li>
<li>Bavière</li>
<li>District de Haute-Bavière</li>
<li>District de Tübingen</li>
</region>
<settlement>
<li>Munich</li>
<li>Tübingen</li>
</settlement>
<orgName>
<li>Université Louis-et-Maximilien de Munich</li>
</orgName>
</list>
<tree>
<country name="Allemagne">
<region name="Bavière">
<name sortKey="Hauser, Maria" sort="Hauser, Maria" uniqKey="Hauser M" first="Maria" last="Hauser">Maria Hauser</name>
</region>
<name sortKey="Mayer, Christian E" sort="Mayer, Christian E" uniqKey="Mayer C" first="Christian E" last="Mayer">Christian E. Mayer</name>
<name sortKey="Soding, Johannes" sort="Soding, Johannes" uniqKey="Soding J" first="Johannes" last="Söding">Johannes Söding</name>
</country>
<country name="Suisse">
<noRegion>
<name sortKey="Mayer, Christian E" sort="Mayer, Christian E" uniqKey="Mayer C" first="Christian E" last="Mayer">Christian E. Mayer</name>
</noRegion>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Sante/explor/MersV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 001E65 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 001E65 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Sante
   |area=    MersV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     PMC:3843501
   |texte=   kClust: fast and sensitive clustering of large protein sequence databases
}}

Pour générer des pages wiki

HfdIndexSelect -h $EXPLOR_AREA/Data/Main/Exploration/RBID.i   -Sk "pubmed:23945046" \
       | HfdSelect -Kh $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd   \
       | NlmPubMed2Wicri -a MersV1 

Wicri

This area was generated with Dilib version V0.6.33.
Data generation: Mon Apr 20 23:26:43 2020. Site generation: Sat Mar 27 09:06:09 2021